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(57) Abstract: Techniques are described for space-time block coding for single-carrier block transmissions over frequency selec- 
tive multipath fading channels Techniques are described that achieve a maximum diversity of order N,N r (L + 1) in rich scattering 
environments, where N t (N r ) is the number of transmit (receive) antennas, and L is the order of the finite impulse responde (FIR) 
channels. The techniques may include parsing a stream of information-bearing symbols to form blocks of K symbols, precoding the 
symbols to form blocks having J symbols, and collecting consecutive N s blocks, generating a space-time block coded matrix having 
N t rows that are communicated through a wireless communication medium. The receiver complexity is comparable to single antenna 
transmissions, and the exact Viterbi's algorithm can be applied for maximum-likelihood (ML) optimal decoding. 
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SPACE-TIME CODED TRANSMISSIONS WITHIN A WIRELESS 
COMMUNICATION NETWORK 

This invention was made with government support under ECS-9979443 
awarded by the National Science Foundation. The government has certain rights in 
the invention. 

TECHNICAL FIELD 

The invention relates to communication systems and, more particularly, 
multiple-antennae transmitters and receivers for use in wireless communication 
systems. 

BACKGROUND 

Space-time (ST) coding using multiple transmit-antennae has been 
recognized as an attractive means of achieving high data rate transmissions with 
diversity and coding gains in wireless applications. However, ST codes are 
typically designed for frequency flat channels. Future broadband wireless systems 
will likely communicate symbols with duration smaller than the channel delay 
spread, which gives rise to frequency selective propagation effects. When 
targeting broadband wireless applications, it is important to design ST codes in the 
presence of frequency selective multipath channels. Unlike flat fading channels, 
optimal design of ST codes for dispersive multipath channels is complex because 
signals from different antennas are mixed not only in space, but also in time. In 
order to maintain decoding simplicity and take advantage of existing ST coding 
designs for flat fading channels, most conventional techniques have pursued two- 
step approaches. In particular, the techniques mitigate intersymbol interference 
(ISI) by converting frequency selective fading channels to flat fading channels, and 
then design ST coders and decoders for the resulting flat fading channels. One 
approach to ISI mitigation has been to employ a relatively complex multiple-input 
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multiple-output equalizer (MIMO-EQ) at the receiver to turn FIR channels into 
temporal ISI-free ones. 

Another approach, with lower receiver complexity, is to employ orthogonal 
frequency division multiplexing (OFDM), which converts frequency selective 
multipath channels into a set of flat fading subchannels through inverse Fast 
Fourier Transform (FFT) and cyclic prefix (CP) insertion at the transmitter, 
together with CP removal and FFT processing at the receiver. On the flat fading 
OFDM subchannels, many techniques have applied ST coding for transmissions 
over frequency-selective channels. Some of these assume channel knowledge, 
while others require no channel knowledge at the transmitter. 

Although using ST codes designed for flat fading channels can at least 
achieve full multi-antenna diversity, the potential diversity gains embedded in 
multipath propagation have not been addressed thoroughly. OFDM based systems 
are able to achieve both multi-antenna and multipath diversity gains of order equal 
to the product of the number of transmit-antennas, the number of receive-antennas, 
and the number of FIR channel taps. However, code designs that guarantee full 
exploitation of the embedded diversity have not been explored. A simple design 
achieves full diversity, but it is essentially a repeated transmission, which decreases 
the transmission rate considerably. On the other hand, for single antenna 
transmissions, it has been shown that a diversity order equal to the number of FIR 
taps is achievable when OFDM transmissions are linearly precoded across 
subcarriers. An inherent limitation of multicarrier (OFDM) based ST transmissions 
is a non-constant modulus, which necessitates power amplifier back-off, and thus 
reduces power efficiency. In addition, multi-carrier schemes are more sensitive to 
carrier frequency offsets relative to their single-carrier counterparts. 

SUMMARY 

In general, the invention is directed to space-time block coding techniques 
for single carrier block transmissions in the presence of frequency-selective fading 
channels. Furthermore, in accordance with the techniques, a maximum diversity 
up to order N t N r (L + 1 ) can be achieved in a rich scattering environment, where N t 
is the number of transmit antennas, N r is the number of receive antennas, and (L + 
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1) is the number of taps corresponding to each FIR channel. The techniques enable 
simple linear processing to collect full antenna diversity, and incur receiver 
complexity that is comparable to single antenna transmissions. Notably, the 
transmissions enable exact application of Viterbi's algorithm for maximum- 
likelihood (ML) optimal decoding, in addition to various reduced-complexity sub- 
optimal equalization alternatives. When the transmissions are combined with 
channel coding, they facilitate application of iterative (turbo) equalizers. 
Simulation results demonstrate that joint exploitation of space-multipath diversity 
leads to significantly improved performance in the presence of frequency selective 
multipath channels. 

In one embodiment, a method may comprise applying a permutation matrix 
to blocks of symbols of an outbound data stream, and generating transmission 
signals from the permutated blocks of symbols. The method may further comprise 
communicating the transmission signals through a wireless communication 
medium. 

In another embodiment, a method may comprise parsing a stream of 
information-bearing symbols to form blocks of K symbols, precoding the symbols 
to form blocks having J symbols, and collecting consecutive N s blocks. The 
method may further comprise applying a permutation matrix to the N s blocks, 
generating a space-time block coded matrix having N t rows, each row containing 
Nd*J symbols, generating N t transmission signals from the symbols of the N t rows, 
and communicating the N t transmission signals through a wireless communication 
medium. 

In another embodiment, a transmitting device may comprise an encoder to 
apply a permutation matrix to blocks of information bearing symbols and to 
generate a space-time block coded matrix of the permutated blocks of symbols. 
The transmitting device further comprises a plurality of pulse shaping units to 
generate a plurality of transmission signals from the symbols of the space-time 
block coded matrix, and a plurality of antennae to communicate the transmission 
signals through a wireless communication medium. 

The details of one or more embodiments of the invention are set forth in the 
accompanying drawings and the description below. Other features, objects, and 
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advantages of the invention will be apparent from the description and drawings, 
and from the claims. 

BRIEF DESCRIPTION OF DRAWINGS 

FIG. 1 is a block diagram illustrating a wireless communication system in 
which a transmitter communicates with a receiver through a wireless channel using 
space-time coded transmissions. 

FIGS. 2 - 3 are timing diagrams illustrating the transmitted sequences from 
the antennas of the transmitter of FIG. 1. 

FIG. 4 is another example transmission format for the transmitter of FIG. 1 . 

FIG. 5 is an example communication system using channel coding with the 
space-time coded transmission techniques in accordance with the principles of the 
invention. 

FIGS. 6 - 9 are graphs illustrating simulated performance results for 
systems with two transmit and one receive antenna. 

DETAILED DESCRIPTION 

The Detailed Description is organized as follows: Section I deals with the 
special case in which a system includes of a receiver having a single antenna, and 
transmitter having two transmit antennas. Section II details the equalization and 
decoding designs. Section III generalizes the proposed schemes to multiple 
transmit- and receive- antennas. Simulation results are presented in Section IV. 

Throughout the Detailed Description, bold upper letters denote matrices, 
bold lower letters stand for column vectors; (•) *, (•) T and (•) H denote conjugate, 
transpose, and Hermitian transpose, respectively; E{«} for expectation, tr{«} for 
the trace of a matrix, | • |j for the Euclidean norm of a vector; Ik denotes the 
identity matrix of size K, Omxn (1 mxn) denotes an all-zero (all-one) matrix with size 
MxN,and F N denotes an NXN FFT matrix with the (p+1 ; q+l)st entry of: 
(l/VA0exp(^27rpy/A0, Vp,q <5 [0,.Y- lj. 

diag(x) stands for a diagonal matrix with x on its diagonal. [«] p denotes the (p+l)st 
entry of a vector, and [•] M denotes the (p+ 1 ; q+1 )st entry of a matrix. 
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I. SINGLE CARRIER BLOCK TRANSMISSIONS 

FIG. 1 is a block diagram illustrating a wireless communication system 2 in 
which a transmitter 4 communicates with a receiver 6 through a wireless 
communication channel 8. In particular, FIG 1 illustrates the discrete-time 
equivalent baseband model in which transmitter 4 transmits a data with two 
transmit antennas (N t = 2), and receiver 6 receives data with a single receive 
antenna (N r = 1). Transmitter 4 includes a precoder 1 1, an encoder 12, two pulse 
shaping units 13 for generating transmission signals, and two transmission 
antennae 14. 

The information-bearing data symbols d(n) belonging to an alphabet A are 
first parsed to K x 1 blocks d(i) := [d(iK); . . . ; d(iK + K-l)] T , where the serial 
index n is related to the block index i by: n = iK + k; k e [0;K - 1]. The blocks d(i) 
are precoded by a J x K matrix © (with entries in the complex field) to yield J x 1 
symbol blocks: s(i) := 0d(i). The linear precoding by 0 can be either non- 
redundant with J = K, or, redundant when J > K. The ST encoder takes as input 
two consecutive blocks s(2i) and s(2i + 1) to output the following 2 J x 2 space-time 
block coded matrix: 

S L (2r) 

§2(20 S2(2i +1) 

m t 

where P is a permutation matrix that is drawn from a set of permutation matrices 
{pj n) }^ , with J denoting the dimensionality J x J. Each performs a reverse cyclic 
shift (that depends on n) when applied to a J x 1 vector a := [a(0); a( 1 );...; a(J -1)] 
T . Specifically, [ a] p =a((J-p+n) mod J). Two special cases are Pj 0) and Pj l) . 
The output of Pj Q) a = [a(J - 1); a(J - 2); . . .; a(0)] T performs time-reversal of a, 
while Pj n a = [a(0); a(J - 1); a(J - 2); ... ; a(l)] T = F y H) Fj H) = F} H) Fj H) a 

corresponds to taking the J-point IFFT twice on the vector a. This double IFFT 
operation in the ST coded matrix is in fact a special case of a Z-transform approach 
originally proposed in Z. Liu and G. B. Giannakis, "Space-time coding with 
transmit antennas for multiple access regardless of frequency-selective multi-path," 



s(2*) -Ps*(2* + 1) 
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in Proc. of Sensor Array and Multichannel Signal Processing Workshop, Boston, 
MA, Mar. 2000, pp. 178-182., with theZ-domain points chosen to be equally 

spaced on the unit circle: {e^^^J^-g. The techniques herein allow for any P 

from the set {P^}^ 1 . 

At each block transmission time interval i, the blocks si(i) and S2(i) are 
forwarded to the first and the second antennae of transmitter 4, respectively. From 
equation (1), we have: 

Sl(2* + 1) = -PSJ(»), S2(2i + 1) = Pst(2i), (2) 

which shows that each transmitted block from one antenna at time slot 2i+l is a 
conjugated and permuted version of the corresponding transmitted block from the 
other antenna at time slot 2i (with a possible sign change). For flat fading channels, 
symbol blocking is unnecessary, i.e., J = K = 1 and P = 1, and the design of (1) 
reduces to the Alamouti ST code matrix. However, for frequency selective 
multipath channels, the permutation matrix P is necessary as will be clarified soon. 

To avoid inter-block interference in the presence of frequency selective 
multipath channels, transmitter 4 insert a cyclic prefix for each block before 
transmission. Mathematically, at each antenna \ie [1, 2], a tall P X J transmit- 
matrixT cp ;= p2p,f/] r , with Ic P comprising the last P -J rows of Ij, is applied 

on S/i(£) to obtain Px 1 blocks: U^(£) = TcpSp(l). Indeed, multiplying T cp 
with 5 fL {i) replicates the last P - L entries of s /4 (i) and places them on its top. The 
transmitted sequences from both antennas of transmitter 4 are depicted in FIG. 2. 

With symbol rate sampling, h H := [h M (0); . . . ; h M (L)] T be the equivalent 
discrete-time channel impulse response (that includes transmit-receive filters as 
well as multipath effects) between the jxth transmit antenna and the single receive 
antenna, where L is the channel order. With the CP length at least as long as the 
channel order, P - J = L, the inter block interference (IB I) can be avoided at the 
receiver by discarding the received samples corresponding to the cyclic prefix. CP 
insertion at the transmitter together with CP removal at the receiver yields the 
following channel input-output relationship in matrix- vector form: x(i) 
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x(0«£fiA®+w(i), P) 

where the channel matrix is circulant with 

P^*]pi? = fy<((P — ^) ^)> and the additive Gaussian noise w(i) 

is assumed to be white with each entry having variance 

Receiver 6 can exploit the following two properties of circulant matrices: 
pi) Circulant matrices can be diagonalized by FFT operations 

H^ = F?D(h^)F, and H* = F?D(h;)F/, (4) 

where 

D&) := diag(h^), and ^ := [H^\ . . . , H^f^f 

with the pth entry being the channel frequency response 

evaluated at the frequency 

H H T 

p2) Pre- and post- multiplying f by P yields ^ : 

PH^P^Hj: and Pfi;P = Hjf. (5) 

With the ST coded blocks satisfying (2), let us consider two consecutive received 
blocks [c.f. (3)]: 

x(2i) = HiSi (2i) + H 2 52(2i) + w(2i), (6) 

x(2i + 1) = -HiPSj(2i) + H 2 PSft2i) + w(2i + 1). (7) 
Left-multiplying (7) by P, conjugating, and using p2), we arrive at: 

Px*(2i + 1) = -H? h{2i) + fi?5i(2i) + Pw'(2i + 1). (8) 
Notice that having permutation matrix P inserted at the transmitter allows the 
Hermitian of the channel matrices in (8) for enabling multi-antenna diversity gains 
with linear receiver processing. 
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We will pursue frequency-domain processing of the received blocks, which we describe by multiplying 
the blocks x(i) with the FFT matrix Fj that implements the /-point FFT of the entries in x(i). Let us 
define y(2i) := F,x(2i),y'(2*+1) := FyPx^&'+lJ.and likewise ^(2i) ;= F,w(2*)andfr(2i+1) := 
F^Pw ¥ (2i + 1). For notalional convenience, we also define the diagonal matrices T>i := D(hi) and 
V-z := D(hg) with the corresponding transfer function FFT samples on their diagonals. Applying the 
property pi) on (6) and (8), we obtain the FFT processed blocks as: 



y(2i) = ViFMX) + P 2 F^(2i) 
y'(2i-hl) = -PIF,52(20 +PJF,§ l (2i) + ^(2*+!). 



(9) 
(10) 



It is important to remark at this point that permutation, conjugation, and FFT operations on the received 
blocks x(i) do not introduce any information loss, or color the additive noises in (9) and (10) that remain 
white. It is thus sufficient to rely only on the FFT processed blocks y(2t) and y*(2* + 1) when performing 
symbol detection. 

After defining y(i) [y 7 (2i),y w (2i + l)] r , we can combine (9) and (10) into a single block matrix- 
vector form to obtain: 



y(i) = 







F,s(2>) 


+ 








F,s(2» + 1) 




r(2i + l) 



(H) 



:=!> 



where the identities Si(2t) = s(2i) and §2(20 = s(2i + 1) have been used following our design in (1). 

Consider a / x / diagonal matrix 2> L2 with non-negative diagonal entries as: = [T>lVi + 
1>£2> 2 ] 1/2 . We can verify that the matrix V in (1 1) satisfies 2>*1> = I 2 <g> 2%, where <g> stands for 
Kronecker product. Based on V\ and 1> 2 , next construct a unitary matrix U, If hi and ha do not 
share common zeros on the FFT grid {e^*}^, then 2> i2 is invertible, and we select U as U := 
2>(I 2 8 2>i 2 ). If hi and h 2 happen to share common zero(s) on the FFT grid (although this event has 
probability zero), then we construct U as follows. Supposing without of loss of generality that hi and 
h 2 share a common zero at the first subcarrier we have that [2>i]u = [2>a]w - [AJu = 0. We 
then construct a diagonal matrix Jf t which differs from T>i only at the first diagonal entry: = 1. 

Similar to the definition of V and £ l2 , we construct V and by substituting X> t with V* v Because 
2>i 2 is invertible, we form U := T^^i ® (Z>i£)~ 1 ]- In summary, no matter whether 2> l2 is invertible or 
not, we can always construct a unitary U, which satisfies U^U = I 2 / and U^2> = I 2 # i> i2 , where 
the latter can be easily verified. As multiplying by unitary matrices does not incur any loss of decoding 
optimality in the presence of additive white Gaussian noise, (1 1 ) yields z(i) ;= [z T (2i) t z r (2i + l)] r as: 



2(i) = U K y(i)^ 



^i2F /S (2») 




' fK2i) 


t> u F.,s(2» + l) 




^(2i + l) 



(12) 



where the resulting noise := [fj"'(2i),»r''(2i + l)] 1 ' - U w [*J 5r (2i) I ^ M (2i + l)] r is still white with 
each entry having variance A r 0 . 
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We infer from (12) that the blocks s(2i) and s(2i + 1) can be demodulated separately without compro- 
mising the ML optimality, after linear receiver processing. Indeed, so far we applied at the receiver three 
linear unitary operations after the CP removal: i) permutation (via P); ii) conjugation and FFT (via F;); 
and iii) unitary combining (via U w ). As a result, we only need to demodulate each information block 
d[i) separately from the following sub-blocks [c.f. (12)]: 

2(i) = SaFjsW + t#) = £i 2 F,9d(i) + t,(i). (13) 

A. Diversity gain analysis 

Let us drop the block index % from (13), and e.g., use d to denote d(i) for notational brevity. With 
perfect CSI at the receiver, wc will consider the pairwise error probability (PEP) P(d -> d'|hi, hg) that 
the symbol block d is transmitted, but is erroneously decoded as d' ^ d. The PEP can be approximated 
using the ChernolT bound as 

P{* -> *K ha) < «p(-*(ft z')/4JV 0 ) ? (14) 

where d(z t z') denotes the Euclidean distance between z and z'. 

Define the error vector as e := d - d r , and a J x (£+1) Vandermonde matrix V with [V] P!q = 
Qx.l>(-j2iqxi/J). The matrix V links the channel frequency response with the time-domain channel taps 
as ii h = Vh^. Starting with (13). we then express the distance as: 

ct 2 (z,z') = ||l>uF^eeH 2 = e*0*F*^F,0e 

= ^ll^^ll 2 =Ell^Vlvl| 2 , (15) 

where D c := diag(Fj0e) such that 7> h FjOe = D c b^ = D c \\. 

We focus on block quasi static channels, i.e., channels that remain invariant over each space-time 
coded block, but may vary from one block to the next. We further adopt the following assumption: 
asO) the channels hi and lx 2 are uncor related; and for each antenna f t € [1,2], the channel is 
zero-mean, complex Gaussian distributed, with covariance matrix := £{h,ili*}. 

if the entries of are i.i.d., then wc have = I^i/(L + 1), where the channel covariance 
matrix is normalized to have unit energy; i.e., tr{Rft^} = L Because general frequency selective 
muliipath channels have covariance matrices with arbitrary rank, we define the "effective channel order" 
as: = rank(R/^) - 1. Consider now the following cigen decomposition: 

IW = U V A^U&, (16) 

where A/^ is an x diagonal matrix with the positive eigenvalues of on its diagonal, 
and Uj^, is an (L -h 1) x (L A 4- 1) matrix having orthonormal columns: Uj^U^ = Defining 
h, 4 = Aj^Uj^tap, we can verify that the entries of are i.i.d. with unit variance. Since and 
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JJ^A^hp have ideniical distributions, we replace the former by the latter in the ensuing PEP analysis. 
A special case of interest corresponds to transmissions experiencing channels with full rank correlation 
matrices; i.e., rank(R/ wl ) = L +■ 1 and — L. As will be clear later on, a rich scattering environment 
leads to R^'s with full rank, which is favorable in broadband wireless applications because it is also 
rich in diversity. 

With the aid of the whitened and normalized channel vector we can simplify (15) to: 

rf 2 (z,z') - IIDcVU^A^ElII 2 + ||D C VU M A^E 2 || 2 . (17) 

From the spectral decomposition of the matrix Aj^A^, where A^ := DcVU/^A,^, we know that 
there exists a unitary matrix U,^ such that U^A^A^U^ = A^, where A^ is diagonal with 
non-increasing diagonal entries collected in the vector A^ := [A^O), A^(l) t . . . , A<y t (£ /t )] 7 '. 

Consider now the channel vectors := U^E/i, with identity correlation matrix. The vector R h is 
clearly zero-mean, complex Gaussian, with i.i.d entries. Using fij,, we can rewrite (17) as: 

d\zj) = A* = f^KMMW + (is) 

Based on (18), and by averaging (14) with respect to the i.i.d. Rayleigh random variables 1^(0 L 
we can upper bound the average PEP as follows: 

If r^ t is the rank of A^ (and thus the rank of A^A^X then A^(0 ^ 0 if and only iff 6 [0 t - 1]. 
It thus follows from (19) that 

n***)<{±) (n^wn^oj • (20) 

We call r € := r Cfl + the diversity gain and [n^ 1 ICS" 1 MO] ^ *e 
coding gain C?^. of the system for a given symbol error vector e. The diversity gain determines the 
slope of ihe averaged (wxt. the random channel) PEP (between s and s') as a function of the signal to 
noise ratio (SNR) at high SNR (iVjj -* 0). Correspondingly, C Cfi determines the shift of this PEP curve in 
SNR relative to a benchmark error rate curve of [l/(4JVo)]~ fc . Without relying on PEP 
to design (nonlinear) ST codes for flat fading channels, we here invoke PEP bounds to prove diversity 
properties of our proposed single-carrier block transmissions over frequency selective channels. 
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Since both G^ and Gc* depend on the choice of e (thus on $ and &\ we define the diversity and 
coding gains for our system, respectively, as: 

G d := mmGte, and G v :=iumC^ (*0 

Based on (21), one can check both diversity and coding gains. However, in this paper, we focus only 
on the diversity gain. First, we observe that the matrix A*< is square of size (£ A +■ 1). Therefore, 
the maximum achievable diversity gain in a two transmit- and one receive-antennae system is G4 — 
]CjLi& + 1) f° r ™ channels with effective channel order £, tl \i = 1,2, while it becomes 2(L + 1) in 
rich scattering environments. This maximum diversity can be easily achieved by e.g., a simple redundant 
transmission where each antenna transmits the same symbol followed by L zeros in two non-overlapping 
time slots. We next examine the achieved diversity levels in our following proposed schemes, which 
certainly have much higher rate than redundant transmissions. 

B. CP-only 

We term CP-only the block transmissions with no precoding: © = I A % / =* if, and s(i) — d(i). The 
word "only" emphasizes that, unlike OFDM, no IFFT is applied at the transmitter. Let us now check the 
diversity order achieved by CP-only. The worst case is to select d - ol/xi and d' = oHjxi implying 
e a (a - a')l /xi , where a, a f € A Verifying that for these error events, the matrix D c » diag(F 7 e) 
has only one non-zero entry, we deduce that r Ct \ — r tf2 — L Therefore, the system diversity order 
achieved by CP-only is G* = Z This is nothing but space-diversity of order two coming from the two 
transmit antennas [c.f. ( 1 3)]. Note that CP-only schemes suffer from loss of multipart! 
diversity. 

To benefit also from the embedded multipath-induced diversity, we have to modify our transmissions. 
C Linearly Precoded CP-only 

To increase our ST system's diversity order, transmitter 4 may utilize linear 
precoding developed originally for single-antenna transmissions. One can view 
CP-only as a special case of the linearly precoded CP-only system (denoted 
henceforth as LP-CP-only) with identity precoder. With s(i) = ©d(i) and carefully 
designed 0 * I K , we next show that the maximum diversity is achieved. We will 
discuss two cases: the first one introduces no redundancy because it uses J = K, 
while the second one is redundant and adopts J = K + L. For non-redundant 
precoding with J = K, it has been established that for any signal constellation 
adhering to a finite alphabet, there always exists a K x K unitary constellation 
rotating (CR) matrix ensuring ® C r that each entry of ©cr (d - d ' ) is non-zero for 
any pair of (d, d ' ). We thus propose to 
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construct 0 = F#© a such that F A © = ©,*. With this construction, D c = diag^e) is guaranteed to 
have non-zero entries on its diagonal, and thus it has full rank. Consequently, the matrix D C V has full 
column rank L+i, and A^e = DJW^A.^ has full column rank = ip+h Hence, the maximum 
achievable diversity order is indeed achieved. 

We emphasize here that the non-redundant precoder © cr is constellation dependent. For commonly 
used BPSK, QPSK, and all QAMs constellations, and for the block size K equal to a power of 2: 
K = 2 m , one class of © CT precoders with lai^e coding gains is found to be : 

© CT ~F,vA(a), and thus, © = A(ct), (22) 

where A(cv) ;= diag(l,a,...,a A '" 1 ) with a € (cW 1 ^}^ 1 , For block size A" 2\ one can 
construct © CT by truncating a larger unitary matrix constructed as in (22) . The price paid for our 
increased diversity gain is that LP-CP-only does not offer constant modulus transmissions, in general. 
However, by designing K to be a power of 2, and by choosing © as in (22), the transmitted signals 
s(*) = A(a)d(i) arc constant modulus if d(i) are PSK signals. Therefore, by selecting A' to be a power 
of 2, we can increase the diversity gain without reducing power efficiency. 

Alternatively, we can adopt a redundant / x K precoder © with / m K+L. Our criterion for selecting 
such tall precoding matrices © is to guarantee that Fj& satisfies the following property: any K rows 
of F/© are linearly independent. One class of F,© satisfying this property includes Vandcrmonde 
matrices ©vnn with distinct generators \pi, . , . ,^J, defined as : 



e _ 1 



i pi 1 pj 



and thus, ©=F?© wn . (23) 



With F/© s=s ©van, we have that © wn e has at least (L + 1) nonzero entries for any e regardless of the 
underlying signal constellation. Indeed, if 0 van e has only L nonzero entries for some e, then it has K 
zero entries. Picking the corresponding K rows of ©^ to form the truncated matrix we have 
© v{m e = 0, which shows that these A" rows are linearly dependent, thus violating the design of the 
precoder © an . With D c = diag(© van e) having at least (L 1 1) nonzero entries, the matrix D C V has 
full rank because any L + 1 rows of V are linearly independent. Thus, the maximum diversity gain is 
achieved with redundant precoding irrespective of the underlying constellation. 

When / € [K f K + L], constellation irrespective precoders arc impossible because ©e can not have 
L + 1 nonzero entries for any e that is unconstrained. Therefore, constellation independent precoders 

are not possible for J < K + L. However, with some redundancy J > K, the design 
of constellation-dependent precoders may become easier. 
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D. Affine Precoded CP-only 

Another interesting class of linear precoders implements an affine transformation: s(i) m ©d(t) + 
&h{i)> where b(*) is a known symbol vector. In this paper, we are only interested in the special form 
of: m , 



where the precoder 0 = Ti is the first K columns of I/, the precodcr ©' = is the last L columns 
of I; , and the known symbol vector b has si2e Lxl with entries drawn from the same alphabet A We 
henceforth term the transmission format in (24) as AP-CP-only. Notice that in this scheme, J — K + L 
andP- J+L. 

Although here we place b(i) at the bottom of for convenience, we could also place b(*) at arbitrary 
positions within s(i). As long as L consecutive symbols are known in all decoding schemes detailed 
in Section I i are applicable. 

Recall that the error matrix D c = diag(F/Tie) does not contain known symbols. Since F/Tj. is a 
Vandermonde matrix of the form (23), the maximum diversity gain is achieved, as discussed in Section l- 
C for redundant LP-CP-only. 

In the CP-based schemes depicted in Fig. 2, the CP portion of the transmitted sequence is generally 
unknown, because it is replicated from the unknown data blocks. However, with AP-CP-only in (24), 
and with the specific choice of P « Pf\ we have Pf >s(i) = [[Pfd(i)\ r , [Pfb(i)lT. which 
implies thai both the data block and the known symbol block are time reversed, but keep their original 
positions. The last L entries of Pj^s(0 are known, and are then replicated as cyclic prefixes. For 
this special case, we depict the transmitted sequences in Fig. 3. In this format, the data block d(t) is 
surrounded by two known blocks, that correspond to the pre-amble and post-amble . Our general 
design based on the CP structure includes this known pre- and post- ambles as a special case. 

Notice that the pre-amble and post-amble have not been properly designed in some 
conventional systems. The consequence is that "edge effects" appear for 
transmissions with finite block length, and an approximation on the order of O 
(L/J) has to be made in order to apply Viterbi's decoding algorithm. This 
approximation amounts to nothing but the fact that a linear convolution can be 
approximated by a circular convolution when the block size is much larger than the 
channel order. By simply enforcing a CP structure to obtain circulant convolutions, 
Viterbi's algorithm can be applied to our proposed AP-CP-only with no 
approximation whatsoever, regardless of the block length and the channel order, as 
will be clear soon. 



d(i) 
«0 



(24) 
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E. ZP-only 

Suppose now that in AP-CP-only, we let b(<) = 0 instead of having known symb Is drawn from the 
constellation alphabet, and we fix P = PfK Now, the adjacent data blocks are guarded by two zero 
blocks, each having length L, as depicted in Fig. 3. Since the channel has only order l t presence of 
2L zeros in the middle of two adjacent data blocks is not necessary. Keeping only a single block of 
L zeros corresponds to removing the CP-insertion operation at the transmitter. On the other hand, one 
could view that the zero block in the previous block serves as the CP for the current block, and thus all 
derivations done for CP-based transmission are still valid. The resulting transmission format is shown in 
Fig. 4, which achieves higher bandwidth efficiency than AP-CP-only. We term this scheme as ZP-only, 
where / = K + L and = 

By mathematically viewing ZP-only as a special case of AP-CP-only with b(i) = 0, it is clear that the 
maximum diversity is achieved. In addition to the rate improvement, ZP-only also saves the transmitted 
power occupied by CP and known symbols. 

For convenience, we list all aforementioned schemes in Table 1, assuming a rich scattering environ- 
ment. Power loss induced by the cyclic prefix and the known symbols, is also considered. It certainly 
becomes negligible when K > L. 

F. Links with multicarrier transmissions 

In this section, we link single carrier with digital multicarrier (OFDM based) schemes. We first exam- 
ine the transmitted blocks on two consecutive time intervals. For LP-CP-only, the transmitted space-time 
matrix is: 

m(S) ui(» + l) 
ug(») u 2 (2t + l) 

If let P = P^> and © — Ff9, we obtain for a general matrix 

"ui(2t) ui(3f + l) 
U2(2») ua(2i + l) 



r,p0d(2t) -T^PO'd* (2t + 1) 



time 



J. space 



(25) 



T cp Fj<*d(2t) -T^FjFffM'CB + l) 
T C|) F?*d(2t + i) T cp Fjf*'d*(2») 



-+ time 
4- space 



(26) 
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TABLE I 

Summary of single carrier schemes in rich-scattering environments 





Rate it 


Diversity Gu 


Power Loss (dB) 


Features 


CP-only 




2 


lOlog^ 


constant modulus (C-M)' 


non-redundant 
LP-CP-only 


t£ \ oc > Ml 




10 log.,. 


constellation-specific prccodcr 
constant modulus 


redundant 
LP-CP-only 




2(L + 1) 




constellation-independent 
Not C-M in general 


AP-CP-only 




2(L + l) 




constellation-independent 
constant modulus 


ZP-only 




2(L+1) 


0 


constel lation-independent 
C-M except zero guards 



- only if informaiion symbols have constant-modulus, e.g., drawn from PSK constellations. 



If Y = I K > then (26) corresponds to the space-time block coded OFDM proposed 
in Y.Li,J.C.Chung,andN.R.Sollenberger,"Transmitter diversity for OFDM systems 
and its impact on high-rate data wireless networks," IEEE Journal on Selected 
Areas in Communications, vol. 17, no.7, pp.1233-1243, Julyl999. Designing ¥ * I 
k introduces linear precoding across OFDM subcarriers, as proposed in other 
conventional techniques. Therefore, LP- CP-only includes linear precoded space- 
time OFDM as a special case by selecting the precoder Oand the permutation P 
appropriately. Although linear precoding has been proposed for space time OFDM 
systems, the diversity analysis has not been provided. The link we introduce here 
reveals that the maximum diversity gain is also achieved by linearly precoded ST- 
OFDM with the Vandermonde precoders. 

Interestingly, linearly precoded OFDM can even be converted to zero padded transmissions. Indeed, 
choosing * to be the first K columns of , we obtain the transmitted block as: u(t) = TcpFp¥d(0 «= 
[^Lu^ 7 (0^Li] 7 » wn * cn ' nscrts 2cros b° tn at lne t0 P anc * at lne b° ttom °f each data Mock. 
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G. Capacity Result 

We now analyze the capacity of the space time block coding format of (1). The equivalent channel 
input-output relationship, after receiver processing, is described by (1 3) as: z = VuFj e + *1> where we 
drop the block index for brevity. Let Z(z; s) denote the mutual information between z and s, and recall 
that J(z; s) is maximized when s is Gaussian distribute. Due to the lack of channel knowledge 
at the transmitter, the transmission power is equally distributed among symbols, with R, » E(ss*} = 
o;Iy. Taking into account the CP of length L y the channel capacity, for a fixed channel realization, is 
thus: 



Define E g — 2cr; as the total transmitted power from two antennas per channel use. As the block size / 
increases, we obtain 



The capacity for frequency selective channels with multiple transmit and receive 
antennas has been described with conventional techniques. The result in (28) 
coincides with that of some of these techniques when we have two transmit 
antennas and one receive antenna. Therefore, our proposed transmission format in 
(1) does not incur capacity loss in this special case. This is consistent with 
techniques where the Alamouti coding is shown to achieve capacity for frequency- 
flat fading channels with such an antenna configuration. To achieve capacity for 
systems with two transmit antennas and a single receive antenna, it thus suffices to 
deploy suitable one-dimensional channel codes, or scalar codes. 

II. EQUALIZATION AND DECODING 

Let z(i) := z{i) for CP-only, LP-CP-only, ZP-only, and z(t) := z(i)-T> i2 FjT 2 \>{i) for AP-CP-only. 
With this convention, we can unify the equivalent system output after the linear receiver processing as: 



where A Fj®> the noise rf{i) is white with covariance oj*I/ t and the corresponding 0 is defined as 
in Section I. 





= F,0d(i) + tf (i) = Ad(i) + 



(29) 
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Brute-force ML decoding applied to (29) requires |*4| A * enumerations, which becomes certainly pro- 
hibitive as the constellation size |*4| and/or the block length K increases. A relatively faster near-ML 
search is possible with the sphere decoding (SD) algorithm, which only searches for vectors that are 
within a sphere centered at the received symbols . The theoretical complexity of SD is polynomial 
in K t which is lower than exponential, but still too high for K > 16. Only when the block size K is 
small, the SD equalizer can be adopted lo achieve near-ML performance at a manageable complexity. 
The unique feature of SD is that the complexity does not depend on the constellation size. Thus, SD is 
suitable for systems with small block size K 9 but with large signal constellations. 

We now turn our attention to low-complexity equalizers by trading off performance with complexity. 
Linear zero forcing (ZF) and minimum mean square error (MMSE) block equalizers certainly offer low 
complexity alternatives. The block MMSE equalizer is: 



where we have assumed that the symbol vectors are white with covariancc matrix R» = E{s(i)s H (i)} = 
<t;Ik> The MMSE equalizer reduces to the ZF equalizer by setting cr* = 0 in (30). 
For non-redundant LP-CP-only with 0 = A(ct), we further simplify (30) to 



(30) 



(3D 
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A. ML decoding for AP-CP-only and ZP-only 
For AP-CP-only and ZP-only, wc have 

where we drop the block index i for simplicity. Distinct from other systems, AP-CP-only and ZP-only 
assure that s has the last L entries known, and the first K entries drawn from the finite alphabet A 
In the presence of white noise, ML decoding can be expressed as: 

S Mi . m arg max hi P(z|s) = argmax - ^ w F/s|| 2 /i\y . (33) 
We next simplify (33), starting with 

- X>uF/s|| 2 = 2Re{s«Fj £ X> l , 2 } - sl^Pf^s - z H z 

2 (34) 

where r ;= tfP&. We let r (X ;= [r] tt , and t> n ;= Recognizing that H,<s expresses nothing but a 
circular convolution between the channel h and s, we have [H^s] rt = £jLoM9*QL-9 mod/- Hence, we 

Sml = argmax £ | ^ f 2 Re{^r IS } - g | ELo /P j J ■ ( 35 > 

For each it = 0, 1, . . . , /, let us define a sequence of state vectors as: f a = i)mod /* • ■ ■ > *(n-*)n»<i /] 7 . 
out of which the first and the last states are known: < g = C/ = l*/-i> . • • , */-jJ P ' Tne symbol sequence 
$q, . . . i determines an unique path evolving from die known initial state C 0 to the known final state 
O Thus, Viterbi's algorithm is applicable. Specifically, we have: 

j-i 

$ml = argmax^/(C ft ,C A+ i). ( 36 ) 



where /« n ,< u +i) is tne brancn metric, tnat is readi 'y obtainable from (35). The explicit recursion 
formula for Viterbt's Algorithm is well known. 

We now simplify the branch metric further. We first have \\%sf = s K £^(H*H,> The 
matrix H := Z%i$#%) has &V/) th enlrv : 

i»w = EE^-'NW - «> mod/) * (37) 

pal ;M> 
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Lei us now select J>2L, and define 

z i 

A - /£(0M» + 0, Tor » = Q, (38) 

It can be easily verified that the first column of ft is Jfi^fc, . - - 0, . . . ,0, ft, . . - , ft] r . Let ft denote 
the circulant matrix with first column [(l/2)ft>, A,... ,&»(>,..., 0| r . Because ft is circulant and Her- 
mitian, ft can be decomposed into: ft « ft + ft* We thus obtain s K fts = 2Rc{s*fts}. Recognizing 
[fts] u = (1/2)&* U + mod/, and combining with (35), we obtain a simplified metric as: 



The branch metric in (39) has a format analogous to the one proposed by 
Ungerboeck for maximum- likelihood sequence estimation (MLSE) receivers with 
single antenna serial transmissions. For multi-antenna block coded transmissions, 
a similar metric has been suggested in conventional systems. The systems, 
however, can suffer from "edge effects" for transmissions with finite block length, 
resulting an approximation on the order of O (L/J), while our derivation here is 
exact. Our CP based design assures a circular convolution, while the linear 
convolution in some conventional systems approximates well a circulant 
convolution only when J » L. Note also that we allow for an arbitrary 
permutation matrix P, which includes the time-reversal in as a special case. 
Furthermore, a known symbol vector b can be placed in an arbitrary position within 
the vector s for AP-CP-only. If the known symbols occupy positions B - L, . . . , B 
- 1, we just need to redefine the states as 

Notice that for channels with order L, the complexity of Viterbi's algorithm is O ( 
|A| L ) per symbol; thus, ML decoding with our exact application of Viterbi's 
algorithm should be particularly attractive for transmissions with small 
constellation size, over relatively short channels. 
B. Turbo equalization for coded AP-CP-only and ZP-only 

So far, we have only considered uncoded systems, and established that full 
diversity is achieved. To further improve system performance by enhancing also 
coding gains, conventional channel coding can be applied to our systems. For 
example, outer convolutional codes can be used in AP-CP-only and ZP-only, as 
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depicted in FIG. 5. Other codes such as TCM and turbo codes are applicable as 
well 

In the presence of frequency selective channels, iterative (turbo) 
equalization is known to enhance system performance, at least for single antenna 
transmissions. We here derive turbo equalizers for our coded AP-CP-only and ZP- 

only multi-antenna systems. 

To enable turbo equalization, one needs (o find the a posteriori probability on the transmitted symbols 
based on the received vector z. Suppose each constellation point $ tt is determined by Q = log 2 \A\ 

bits [cnfi, . . . ,Cn,<?-i}. Let us consider the log likelihood ratio (LLR): 

^^ w C ^" + ln ' *€ft/-Hi </€[0,Q-l]. (40) 

The log-likelihood ratio in (40) can be obtained by running two generalized 
Viterbi recursions: one in the forward direction and one in the backward direction 
period. Our branch metric is modified as follows: 

<K<«,< rt+ i) - + la' > (C„til<J. (41) 

This modification is needed to take into account the a priori probability /*(C«+ilC*)» determined by 
the extrinsic information from the convolutional channel decoders during the turbo iteration. When the 
transition from C rt to £ aH is caused by the input symbol $ ni we have Uir(f ft+l |Cj = UiF(^). We 
assume that the bit interleaver in Fig. 5 renders the symbols s a independent and equal likely, such that 
iu =s lui^c**)* which in turn can be determined by the LLRs for bits {<^}£Tq * 

Finally, we remark that one could also adopt the known turbo decoding 
algorithm that is based onMMSE equalizers. This iterative receiver is applicable 
not only to AP-CP-only and ZP-only, but also to CP-only and LP-CP-only systems. 
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C. Receiver Complexity 

Omitting the complexity of permutation and diagonal matrix multiplication, the linear processing to 
reach ( 1 3) only requires one size-/ FFT per block, which amounts to 0(lo$ z J) per information symbol. 

Channel equalization is then performed based on (13) for each block. We notice that the complexity is 
the same as the equalization complexity for single antenna block transmissions over FIR channels [43]. 
We refer the readers to (43] for detailed complexity comparisons of the different equalization options. 
For coded AP«CP-only and ZP-only, the complexity of turbo equalization is again the same as that of 
single antenna transmissions [13]. 

In summary, the overall receiver complexity for the two transmit antenna case is comparable to that of 
single antenna transmissions, with only one additional FFT per data block. This nice property originates 
from the orthogonal space-time block code design, that enables linear ML processing to collect antenna 
diversity. Depending desirable/affordable diversity-complexity tradeoffs, the designer is then provided 
with the flexibility to collect extra multipalh-diversity gains. 
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III. EXTENSION TO MULTIPLE ANTENNAS 



In Section I, we focused on N 



v r s = 2 transmit- and A r r = 1 receive- antennae. In this section, we 



will extend our system design to the general case with ;V t > 2 and/or >V r > 1 antennas. For each 
ft S3 1, . . . , A r j and v = 1, . . . , iV P , wc denote the channel between the /ith transmit- and (he j/th receive- 
antennac as hp„ [1^(0), . . . , /^(J*)] 7 . and 88 before we model it as a zero-mean, complex Gaussian 
vector with covariance matrix R*,,,,,. Correspondingly, we define the efFcctive channel order := 
rank{R* t , fl ,} - 1, which for a sufficiently rich scattering environment becomes =* L. 

Transmit diversity with ;V { > 2 has been addressed in for OFDM based multicarrier transmis- 
sions over FIR channels by applying the orthogonal ST block codes of on each OFDM subcarrier. 
Here, we extend the orthogonal designs to single carrier block transmissions over frequency selective 
channels. 

We will review briefly generalized orthogonal designs to introduce notation, starting with ihe basic 
definitions given in the context of frequency-flat channels : 

Definition 1 : Define x ;» [x x , . . . , xx,\ r < and let £ r (x) be an x A r { matrix with entries 0, ±a? lt . . 
±#A', • If (x){7 r (x) = cc(rf + • • ■ + *A',)I,v t with cc positive, then {r r (x) is termed a generalized real 
orthogonal design (GROD) in variables jf if . . . t x N$ of size N d x N % and rate R=*Nj ty. 
Definition 2: Define x :=* [*i,...,v < vj r s and let £ c (x) be an N d x iV t matrix with entries 0, ±x l9 
±x\, .... ±*jy„ ±apjr t . If £*(x)&(x) - aQxtf + • + kYj 2 )I,v, with a positive, then &(x) is 
termed a generalized real orthogonal design (GCOD) in variables x\ . * f x*? $ of size N* x N t and rate 
R « A r */'Nt- 

Explicit construction of £ r (x) with it = 1 was discussed in [34], where it was also proved that the 
highest rate for £ c (x) is 1/2 when i\ r f > 4. When N t = 3,4, there exist some sporadic codes with rate 
R ss 3/4. Although the orthogonal designs with R = 3/4 for =3 , 4 have been incorporated 
for multicarrier transmissions, we will not consider them in our single carrier block transmissions here; 
we will only consider the R = 1/2 GCOD designs primarily because GCOD &(x) of R = 1/2 can be 
constructed using the following steps (N, — 4 for >V t = 3,4, while N 9 » 8 for iVf = 5, 6, 7, § (34)): 

si) construct GROD £ r (x) of size N, x N t with Jt s 1; 

s2) replace the symbols » ir . . . ,3?^ in £ r (x) by their conjugates . . . io arrive at ? r (^); 



As will be clear soon, we are explicitly taking into account the fact that all symbols from the upper- 
part of &(x) are un-conjugatcd, while all symbols from the lower-part are conjugated. The rate loss can 
be as high as 50%, when JV^ > 2. 

With iV t > 2, the space-time mapper takes A> consecutive blocks to output the following N t J x 
space time coded matrix (N d = 2N f ) 



s3)formg c (x) = [^(x) ? ^V)] r - 



Si(^V a + iV rf -l) 



time. 



g(r) = f A',), . . . , s(«V, + AT, - 1)} - 



(42) 



i space 
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The design steps are summarized as follows: 

dl) construct & of size 2iV. x JVJ in the variables x u . . . f s# §9 as in sl>s3); 
d2) Replace xi> . . . , x\ Y . in £?' by s(f ;Y # ), . . . , s(kV. + /V P - 1); 

d3) Replace in gj by Ps m (iN 0 ) t . ..,Ps'(wV„ + A r , - 1), where P is taken properly for 

different schemes as explained in Section 1. 

At each block transmission slot i> S,,(i) is forwarded to the /Uh antenna, and transmitted through the 
FIR channel after CP insertion. Each receive antenna processes blocks independently as follows: The 
receiver removes the CP, and collects N d = 2i\ f # blocks x(tA r *),. . . ,x(fi>b + Nk - X). Then FFr is 
performed on the first A r # blocks x(tjV<<),. • • ,x(ftjVtf + A r , - 1), while permutation and conjugation is 
applied to the last N, blocks: Px'(tjY u + A r ,), . . . , Px*(iAT u + A r tf - 1), followed by FFT processing. 
Coherently combining the FFT outputs as we did for the two antennae case to derive (13), we obtain on 
each antenna the equivalent output after the optimal linear processing: 

where P„ ;~ \£* x P^*VJ 1/2 and ;« diag(V) - diag(VV)- 

We next stack the z„(i) blocks to form 2(t) = [sftO,.. i2^ r (01 r (likewise for »y(i)), and define 
B [Pi,.. .,P^l r , to obtain: S(0 = BF,s(i) + Defining B := Z^^k*VP. 
we have B*B = B 2 . Therefore, we can construct a matrix U fr BB~ l , which has orthonormal 
columns U^U fr n I/, and satisfies U^B » B. As and B share range spaces, multiplying Ujp by 
z(i) incurs no loss of optimality, and leads to the following equivalent block: 

z{{) :« U«2(t) = BF,s(i) + 7(0, (44) 

where the noise rf{i) is still while. Now the distance between z and z 7 , corresponding to two different 
symbol blocks d and d\ becomes: 

^(z, Z ')=f;f;iiD c vh I< ^. (45) 

Comparing (45) with (15), the contribution now comes from N t N r multipath 
channels. Following the same steps as in Section I, the following result can be 
established : 

Proposition 1 : The maximum achievable diversity order is J2*U + Wim ^ transmit- and 

N T receive- antennas, which equals J\ r t N r (L -h 1) when the channel convlation has full rank, 

1. CP-only achieves multi-antenna diversity of order N t N r : 

2. LP-CP-only achieves the maximum diversity gain through either non-redundant hut constellation- 
dependent, or, tvdtmdant hut constellation-independent preaxling; 

j. Afftne precoded CP-only and ZP-only achieve the maximum diversity gain irrespective of the underly- 
ing signal constellation. 
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The linear ML processing t reach (44) requires a total of tf d N T = VS r *N r FFTs corresponding lo 
each space-time coded block of (42), which amounts to 2N r FFTs per information block. Channel 
equalization based on (44) incurs identical complexity as in single antenna transmissions. For AP-CP- 
only and ZP-only, the ML estimate & Mt = argmax (-||z - 8Fs|| 2 /A&) can be obtained via exact 
application of Vitcrbt's algorithm. Relative to the two antenna case detailed in Section ll-A, we can 
basically use the same expression for the branch metric of (39), with two modifications, namely: r tt = 
[r|a withr = F*Bs, and 

ftal ts*=.\ <«0 

We summarize the general complexity results of this section and those of Section 
II in the following. 

Proposition 2: The proposed space-time block coded CP-only, LP-CP-only. AP-CP-only and ZP-only 
systems with AT t > 2 (TV* = 2) transmit- and N r receive- antennas require an additional complexity of 
0(2N r lo& 2 J) (respectively Q(N r loffoJ)) per information symbol, relative to their counterparts with 
single transmit- and single receive- antenna, where J is the FFT size. 
IV. SIMULATED PERFORMANCE 

In this section, we present simulation results for systems with two transmit- and 
one receive- antenna. For ease in FFT processing, we always choose the block size 
J to be a power of 2. In all figures, we define SNR as the average received symbol 
energy to noise ratio at the receive antenna. For reference, we also depict the 
(outage) probability that the channel capacity is less than the desired rate, so that 
reliable communication at this rate is impossible. Specifically, we calculate (28) 
numerically, we evaluate the outage probability at the targeted rate R as 
P(Cj~ rQ v < R) with Monte-Carlo simulations. 

Test Case 1 (comparisons for different equalizers): We first set L = 2, and assume 
that the channels between each transmit and each receive antenna are i.i.d., 
Gaussian, with covariance matrix I L +i/(L+l). We investigate the performance of 
ZP-only with block sizes: K = 14,and P = J - 16. We adopt QPSK constellations. 
FIG. 6 depicts the block error rate performance corresponding to MMSE, DFE, SD, 
and ML equalizers. We observe that the SD equalizer indeed achieves near-ML 
performance, and outperforms the suboptimal block DFE as well as the block 
MMSE alternatives. Without channel coding, the performance of ZP-only is 
faraway from the outage probability at rate 2K/(K + L) =1.75 bits per channel use. 
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Test Case 2 (convolutionally coded ZP-only): We here use two i.i.d. taps per FIR 
channel, i.e., L = 1. We set the block sizes as K = 127, P = J = 128 for our ZP-only 
system, and use 8-PSK constellation. For convenience, we view each block of 
length P = 128 as one data frame, with the space time codes applied to two adjacent 
frames. Within each frame, the information bits are convolutionally coded (CC) 
with a 16-state rate 2/3 encoder. Omitting the trailing bits to terminate the CC 
trellis, and ignoring the rate loss induced by the CP since L « K, we obtain a 
transmission rate of 2 bits per channel use. 

Turbo decoding iterations are performed. With the 16-state convolutional 
code, the frame error rate for ZP-only is within 2.3dB away from the outage 
probability. 

Test Case 3 (convolutionally coded AP-CP-only over EDGE channels): We 
test the Typical Urban (TU) channel with a linearized GMSK transmit pulse shape, 
and a symbol duration T = 3.69 ^s as in the proposed third generation TDMA 
cellular standard EDGE (Enhance Date Rates for GSM Evolution). The channel 
has order L = 3 and correlated taps. We use QPSK constellations, and set the block 
size J = 128. We adopt AP-CP-only that guarantees perfectly constant modulus 
transmissions. Within each frame of 128 symbols, the last 3 are known. 
Information bits are coded using a 16-state rate 1/2 convolutional code. Taking into 
account the known symbols, the cyclic prefix, and zero bits to terminate the CC 
trellis, the overall transmission rate of the proposed AP-CP-only is (128 - 3 - 
4)/(128 + 3) = 0.924 bits per channel use, or 250.4 kbps. 

As shown in FIG. 8, the system with two transmit antennas significantly 
outperforms its counterpart with one transmit antenna. At frame error rate of 1 0" 2 , 
about 5dB SNR gain has been achieved. FIG. 9 depicts the performance 
improvement with turbo iterations, which confirms the importance of iterative over 
non-iterative receivers. A large portion of the performance gain is achieved within 
three iterations. 

Various embodiments of the invention have been described. These and 
other embodiments are within the scope of the following claims. 
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CLAIMS: 

1 . A method comprising: 

parsing a stream of information-bearing symbols to form blocks of K 
symbols; 

precoding the symbols to form blocks having J symbols; 

collecting consecutive N s blocks; 

applying a permutation matrix to the N s blocks; 

generating a space-time block coded matrix having N t rows, each row 
containing Np*J symbols; 

generating N t transmission signals from the symbols of the N t rows; and 

within N D block transmission time intervals, communicating the N t 
transmission signals through N t transmit antennas over a wireless communication 
medium. 

2. The method of claim 1 , wherein J > K. 

3. The method of claim 1 , wherein J = K. 

4. The method of claim 1 , wherein N t =2 and N D =2, and applying the 
permutation matrix comprises applying the permutation matrix to generate the 
space-time coded matrix according to the following equation: 

s(2£) -Ps*(2i + 1) 
s(2* + l) Ps'(2*) 

where P represents a permutation matrix, i represents an index into the blocks of 
symbols, and s represents symbol block. 

5. The method of claim 4, wherein the permutation matrix is drawn from a set 
of permutation matrices {P/^}n«o-- 
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6. The method of claim 4, wherein each row of a second column of the space- 
time block coded matrix stores a block that is a conjugated and permuted version 
of a corresponding block from an other row of a first column. 

7. The method of claim 1 , wherein precoding the symbols comprises adding a 
set of known symbols to each group of K symbols. 

8. The method of claim 7, wherein the set of known symbols comprises a 
preamble and a post amble. 

9. The method of claim 1 , further comprising: 

receiving a signal from the wireless communication medium, wherein the 
signal comprises a stream of received symbols; 

parsing the received symbols of the input signal to form blocks of J 
symbols; 

applying the permutation matrix to the blocks of the received symbols; and 
separately demodulating transmitted data from the permutated blocks of 
received symbols. 

10. The method of claim 9, further comprising conjugating and applying a Fast 
Fourier Transform (FFT) to the blocks. 

11. An apparatus comprising: 

an encoder to apply a permutation matrix to blocks of information bearing 
symbols and to generate a space-time block coded matrix of the permutated blocks 
of symbols; 

a plurality of pulse shaping units to generate a plurality of transmission 
signals from the symbols of the space-time block coded matrix; and 

a plurality of antennae to communicate the transmission signals through a 
wireless communication medium. 
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1 2. The apparatus of claim 1 1, wherein the encoder collects consecutive N s 
blocks within a buffer, applies a permutation matrix to the N s blocks, and forms a 
space-time block coded matrix having N ( rows of symbols. 

1 3 . The apparatus of claim 1 1 , further comprising a precoder to 
precode the symbols to form blocks having J symbols, wherein each row of the 
space-time block coded matrix contains Nd*J symbols, wherein N D represents a 
number of block transmission time intervals for transmitting the space-time matrix. 

14. The apparatus of claim 11, wherein the precoder adds a set of known 
symbols to each group of K symbols. 

1 5. The apparatus of claim 14, wherein the set of known symbols comprises a 
preamble and a post amble. 

1 6. The apparatus of claim 1 1 , wherein J > K. 

1 7. The apparatus of claim 1 1 , wherein J = K. 

18. The apparatus of claim 1 1 , wherein N t =2 and the encoder applies the 
permutation matrix to generate the space-time coded matrix according to the 
following equation: 



where P represents a permutation matrix, i represents an index into the blocks of 
symbols, and s represents symbol block. 

1 9. The apparatus of claim 13, wherein the permutation matrix is drawn from a 



5(20 
s(2s + l) 



Ps*(2t + 1) 
Ps'(2i) 



set of permutation matrices 
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20. The apparatus of claim 1 1 , wherein the apparatus comprises a base station 
within a wireless communication system. 

2 1 . The apparatus of claim 1 1 , wherein the apparatus comprises one of a 
cellular phone, a personal digital assistant, a laptop computer, a desktop computer, 
a two-way communication device. 

22. A method comprising: 

applying a permutation matrix to blocks of symbols of an outbound data 

stream; 

generating transmission signals from the permutated blocks of symbols; 

and 

communicating the transmission signals through a wireless communication 
medium. 

23. The method of claim 22, further comprising generating a space-time block 
coded matrix having N t rows, wherein N t represents a number of transmitters 
within a transmission device. 

24. The method of claim 23, further comprising: 

parsing the outbound data stream of information-bearing symbols to form 
blocks of K symbols; 

precoding the symbols to form blocks having J symbols; 
collecting consecutive N s blocks; and 

generating the space-time block coded matrix to have N t rows and No* J 
symbols per row, wherein Nd represents a number of transmission time intervals 
for communicating the transmission signals. 



29 



WO 02/098051 



PCT/US02/16897 




WO 02/098051 



PCT/US02/16897 



2/5 



Tx2 





^ -[PsCSi-H)!* j 


: CP 




CP 




il 






il 




-s the (2i)lh block a* 


■« the (2i+l)st block — » 



FIG. 2 



prc-atnble post-amble 













Txl 


bf 






«? 


_[pP*CH+1)J* 


b? 




CP 


b,,;=b(2i) 
b*:=b(2i+-l) 




CP 


&z : = P^ba 




Tx2 




[d!(2i+l)f 


b£ 


6* 




b? 




^ 


the (2i)th block — 


3s* 


«s the(2i+l)st block — *■ 



FIG. 3 



WO 02/098051 



3/5 



PCT/US02/16897 




WO 02/098051 PCT/US02/16897 




: : : . . . ^ . . 

H i i i i i i i_J> 1 1 1 

6 7 8 9 10 It 12 13 \4 15 16 

SNR (dB) 



FIG. 6 




SNR (dB) 



FIG. 7 



WO 02/098051 PCTVUS02/16897 



5/5 





SNR (dB) 



FIG. 9 



INTERNATIONAL SEARCH REPORT 


IjAnatlonal Application No 




TCT/US 02/16897 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 H04L1/06 



According to International Paieni Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system foDowed by classitication symbols) 

IPC 7 H04L 



Documentation searched other than minimum documentation to the extent that such documents are Included in the tields searched 



Electronic data base consulted during the international search (name ol data base and, where practical, search terms used) 

INSPEC, COMPENDEX 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



LIU Z ET AL: "Space-time coding for 
broadband wireless communications" 
WIRELESS COMMUNICATIONS AND MOBILE 
COMPUTING, JAN. -MARCH 2001, WILEY, UK, 
vol. 1, no. 1, pages 35-53, XP002210873 
ISSN: 1530-8669 
page 37, paragraph 2.1 
page 39, paragraph 2.3.2 
page 41, paragraph 3.2 -page 44 
figure 5 

-/-- 



1-24 



13 



Further documents are listed in the continuation ol box C. 



□ 



Patent family members are listed in annex. 



• Special categories of cited documents : 

*A* document defining the general state of the art which is not 

considered to be of particular relevance 
'E' earlier document but published on or after the international 

filing date 

'L* document which may throw doubts on priority claim(s) or 
which Is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0* document referring to an oral disclosure, use, exhibition or 
other means 

'P* document published prior to the international filing date but 
later than the priority date claimed 



*T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

•X 1 document of particular relevance; the claimed Invention 
cannot be considered novel or cannot be considered to 
Involve an inventive step when the document is taken alone 

'Y* document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

*&' document member of the same patent family 



Date of the actual completion of the international search 

23 August 2002 


Date of mailing of the international search report 

02/10/2002 


Name and mailing address of the ISA 

European Patent Office. P.B. 5818 Patent laan 2 
NL-2280HV Rijswijk 
TeL (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 


Authorized officer 

Schlffer, A 



Form PCT/tSA/210 (second sheet) (July 1992) 



page 1 of 2 



INTERNATIONAL, SEARCH REPORT l ^ 18tlonal Appllcotlon No 

PCT/US 02/16897 


C.(Contlnuatlon) OOCUMENTS CONSIDERED TO BE RELEVANT 


Category • 


Citation o» document, with Indicatlon.where appropriate, ot the relevant passages 


Relevant to claim No. 


A 


ZHEN6DA0 WANG ET AL: "Linearly precoded 
or coded OFDM against wireless channel 
fades?" 

2001 IEEE THIRD WORKSHOP ON SIGNAL 
PROCESSING ADVANCES IN WIRELESS 
COMMUNICATIONS (SPAWC'Ol). WORKSHOP 
PROCEEDINGS (CAT. NO.01EX471), PROCEEDINGS 
OF SPAWC-20001. THIRD IEEE SIGNAL 
PROCESSING WORKSHOP ON SIGNAL PROCESSING 
ADVANCES IN WIRELESS COMMUNIC, 

pages 267-270, XP002210874 
2001, Piscataway, NJ, USA, IEEE, USA 
ISBN: 0-7803-6720-0 
the whole document 


1-24 



Form PCT/1SA/210 (continuation ot second shoe!) (Jury 1992) 



page 2 of 2 



